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Abstract. We investigate families of infinite automata for context-sensitive languages. 
An infinite automaton is an infinite labeled graph with two sets of initial and final vertices. 
Its language is the set of all words labelling a path from an initial vertex to a final vertex. 
In 2001, Morvan and Stirling proved that rational graphs accept the context-sensitive 
languages between rational sets of initial and final vertices. This result was later extended 
to sub-families of rational graphs defined by more restricted classes of transducers. 

Our contribution is to provide syntactical and self-contained proofs of the above re- 
sults, when earlier constructions relied on a non-trivial normal form of context-sensitive 
grammars defined by Penttonen in the 1970's. These new proof techniques enable us to 
summarize and refine these results by considering several sub-families defined by restric- 
tions on the type of transducers, the degree of the graph or the size of the set of initial 
vertices. 



One of the cornerstones of formal language theory is the well-known hierarchy intro- 
duced by Chomsky in |Cho59j . It consists of the regular, context-free, context-sensitive 
and recursively enumerable languages. This hierarchy was originally defined by imposing 
syntactical restrictions on the rules of grammars generating the languages. These four fam- 
ilies of languages as well as some of their sub-families have been extensively studied. In 
particular, they were given alternative characterizations in terms of finite acceptors. They 
are respectively accepted by finite automata, pushdown automata, linearly bounded au- 
tomata and Turing machines. Recently, these families of languages have been characterised 
by families of infinite automata. An infinite automaton is a labelled countable graph to- 
gether with a set of initial and a set of final vertices. The language it accepts (or simply 
its language) is the set of all words labelling a path from an initial vertex to a final vertex. 
In |CK02| . a summary of four families of graphs corresponding to the four families in the 
Chomsky hierarchy was given: they are respectively the finite graphs, prefix-recognisable 
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graphs |Cau96| ICauf)3aj , rational graphs |MorOflj and transition graphs of Turing machines 
|Cau03b| (for a survey, see for instance |Tho01| ). 

This work specifically deals with a family of infinite automata for context-sensitive lan- 
guages. The first result on this topic is due to Morvan and Stirling |MS01| . who showed 
that the languages accepted by rational graphs, whose vertices are words and whose edges 
are defined by rational transducers, taken between rational or finite sets of vertices, are pre- 
cisely the context-sensitive languages. This result was later extended by Rispal |Ris02| to the 
more restricted families of synchronized rational graphs, and even to synchronous graphs. A 
summary can be found in |MR04j . All proofs provided in these works use context-sensitive 
grammars in Penttonen normal form |Pen74j to characterize context-sensitive languages, 
which has two main drawbacks. First, this normal form is far from being obvious, and the 
proofs and constructions provided in |Pen74j are known to be difficult. Second, and more 
importantly, there is no grammar-based characterization of deterministic context-sensitive 
languages, which forbids one to adapt these results to the deterministic case. 

Our main contributions are a new syntactical proof of the theorem by Stirling and 
Morvan based on the thight correspondance between tiling systems and synchronized graphs 
and an in depth study of the trade off between the structure of the rational graphs (number 
of initial vertices and out-degree) , the transducers defining them, and the family of languages 
they accept, as summarized in Tabled 

Each row of the table concerns a family or sub-family of rational graphs, and each 
column corresponds to a structural restriction of that family with respect to sets of initial 
vertices and degree. The first case is that of rational (infinite) sets of initial vertices, while 
the second case only considers the fixed rational initial set {a}* over a single letter a. The 
two remaining cases concern graphs with a unique initial vertex, with respectively arbitrary 
and finite out-degree. A cell containing an equality symbol indicates that the languages 
accepted by the considered family of graphs (row) from the considered set of initial vertices 
(column) are the context-sensitive languages. An inclusion symbol indicates that their 
languages are strictly included in context-sensitive languages. A question mark denotes a 
conjecture. When relevant, we give a reference to the proposition, theorem or remark which 
states each result. 



Family of graphs 


Set of initial vertices 


Rational set 


Set {a}* 


Unique vertex 


Unique vertex 
(finite degree) 


Rational |MSnij 








= WM 


Synchronized |Bis02| 






= m 


c (?) |4.1()| 


Synchronous |Ris02j 




= m 




c 


Sequential synchronous 




c (?) a 


c 


c 



Table 1: Families of rational graphs and their languages. 



Finally, we investigate the case of deterministic languages. A long-standing open prob- 
lem in language theory is the equivalence between deterministic and non-deterministic (or 
even unambiguous) context-sensitive languages |Kur64j . Thanks to our constructions, we 
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characterize two syntactical sub-families of rational graphs respectively accepting the un- 
ambiguous and deterministic context-sensitive languages. 

Outline. Our presentation is structured along the following lines. The definitions of 
rational graphs and context-sensitive languages are given in Section [2 The results concern- 
ing languages accepted by rational and synchronous rational graphs are given in Section |3 
In Section |IJ we investigate rational graphs under structural constraints, and finally Section 
El is devoted to deterministic context-sensitive languages. 

2. Definitions 

2.1. Notations 

Before all, we fix notations for words, languages and automata, as well as directed 
graphs and the languages they accept. For a more thorough introduction to monoids and 
rationality, the interested reader is referred to |Ber79| ISak03j . 

2.1.1. Languages and Automata 

We consider finite sets of symbols, or letters, called alphabets. In the following, £ and 
r always denote finite alphabets. Tuples of letters are called words, and sets of words 
languages. The word u corresponding to the tuple (u±, . . . , u n ) is written u\ . . . u n . Its i-th 
letter is denoted by u(i) = U{. The set of all words over E is written E*. The number 
of letter occurrences of u is its length, written \u\ = n. The unique word of length is 
written e. The concatenation of two words u = u\ . . .u n and v = v\ . . . v m is the word 
uv = u\ . . .u n vi . . .v m . This operation extends to sets of words: for all A, B C E*, AB 
stands for the set {uv \ u € A and v € B}. By a slight abuse of notation, we will usually 
denote by u both the word u and the singleton {u}. 

A monoid is composed of a set M together with an associative internal binary law on 
M called product, with a neutral element in M. The product of two elements x and y of 
M is written x ■ y. An automaton over M is a tuple A = [L, Q, qo, F, 5) where L C M is a 
finite set of labels, Q a finite set of control states, qo € Q is the initial state, F C Q is the 
set of final states and iCQxixQis the transition relation of A. 

A run of A is a sequence of transitions (qo,h,Qi) ■ ■ ■ (in-i. In, Qn)- It is associated to 
the element m = l\ ■ . . . ■ l n € M. If q n belongs to F, the run is accepting (or successful), 
and m is accepted, or recognized, by A. The set of elements accepted by A is written L{A). 
A is unambiguous if there is only one accepting run for each element in L(A). 

The star of a set X C M is defined as X* := U fc >o^ fc with x ° = i e } and xk+1 = 
X ■ X k . Similarly, we write X + := [J k>l X k . The set of rational subsets of a monoid is 
the smallest set containing all finite subsets and closed under union, product and star. The 
set of all words over E together with the concatenation operation forms the so-called free 
monoid whose neutral element is the empty word e. Finite automata over the free monoid 
E* are known to accept the rational subsets of E*, also called rational languages. 
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2.1.2. Graphs 

A labeled, directed and simple graph is a set G C V x V x V where T is a finite 
set of labels and V a countable set of vertices. An element (s, a, t) of G is an edge of 
source s, target t and /a&e/ a, and is written s -A i or simply s — — > t if G is understood. 

The set of all sources and targets of a graph form its support Vq. A sequence of edges 
s\ —U t\, . . . ,Sk — ^ ifc with Vi € [2, /c], Sj = is called a pai/i. It is written si — A ifc, 
where u = a\ . . .a^ is the corresponding pai/i Za&eZ. A graph is deterministic if it contains 
no pair of edges having the same source and label. The path language of a graph G between 
two sets of vertices / and F is the set 

L{G,I,F) := {w | s At, s G /, t G F}. 

If two infinite automata recognize the same language, we say they are trace-equivalent. In 
this paper, we consider infinite automata: infinite graphs together with sets of initial and 
final vertices. We will no longer distinguish the notion of graph with initial and final vertices 
from the notion of automaton. However, as we will see in Section |31 with no restriction 
on the set of initial vertices and on the structure of the graph this might not provide a 
reasonable extension of finite automata. 

2.2. Word transducers 

Automata can be used to accept more than languages. In particular, when the edges of 
an automaton are labelled with pairs of letters (with an appropriate product operation) , its 
language is a set of pairs of words, which can be seen as a binary relation on words. Such 
automata are called finite automata with output, or transducers, and they recognize rational 
relations. We will now recall their definition as well as some of their important properties. 
For a detailed presentation of transducers, see for instance |Ber79llP7i00llSak03j . 

Consider the monoid whose elements are the pairs of words (u,v) in £*, and whose 
composition law is defined by (ui,v\) ■ (^2,^2) = (^1^2, ^1^2), generally called the product 
monoid and written S* x S*. A transducer T over a finite alphabet £ is a finite automaton 
over S* x £* with labels in (S U {e}) x (S U {s}). Finite transducers accept the rational 
subsets of £* x £*. We do not distinguish a transducer from the relation it accepts and 
write {w,w') G T if (w,w') is accepted by T. The domain Dom(T) (resp. range Ran(T)) 
of a transducer T is the set {w \ (w,w') G T} (resp. {w' \ (w,w') G T}). We also write 
T{L) the set of all vertices v such that (u, v ) G T for some u G L. A transducer accepting 
a function is called functional. 

In general, there is no bound on the size difference between input and output in a 
transducer. Interesting subclasses are obtained by enforcing some form of synchronization. 
For instance, length-preserving rational relations are recognized by transducers with labels 
in S x S, called synchronous transducers. Such relations only pair words of the same size. 

A more relaxed form of synchronization was introduced by Elgot and Mezei |EM65| : a 
transducer over E with initial state qo is left-synchronized if for every path 

qo — ► qi ■ --qn-i — ► q n , 

there exists k G [0,n] such that for all i G [0, k], Xi and belong to S and either Xj = e 
for all j > k or yj = e for all j > k. In other terms, a left-synchronized relation is a finite 
union of relations of the form S ■ F where S is a synchronous relation and F is either equal 
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Figure 1: The grid and its associated transducers. 



to {e} x R or R x {e} where R is a rational language. Right- synchronized transducers are 
defined similarly. In the following, unless otherwise stated, we will refer to left-synchronized 
transducers simply as synchronized transducers. 

The standard notion of determinism for automata does not have much meaning in the 
case of transducers because it does not rely only on the input but on both the input and 
the output. A more refined notion is that of sequentiality: a transducer T with states Q is 



x/y 



x'/y' 



sequential if for all q, q' and q" in Q, if q — > q' and q — ► q" then either x 



x 



y = y' and 



q' = q", or x ^ e, x' ^ e and x ^ x' . 

Remark 2.1. The standard determinization procedure applied to a synchronous transducer 
yields an equivalent unambiguous synchronous transducer (i.e for every pair of words (u, v) 
accepted by the transducer there is exactly one accepting run of the transducer labelled by 
u/v). This remains true for synchronized transducers. 

It is well-known that there is a close relationship between rational languages and rational 
transductions. In particular, rational relations have rational domains and ranges, and are 
closed under restriction to a rational domain or range. Moreover, the restriction of a 
sequential (resp. synchronous) transducer to a rational domain is still sequential (resp. 
synchronous) (see for instance |Ber79j ). 



2.3. Rational graphs 

The Chomsky-like hierarchy of graphs presented in |CK02| uses words to represent 
vertices. Each of these graphs is thus a finite union of binary relations on words, each 
relation corresponding to a given edge label. In particular, the family of rational graphs 
owes its name to the fact that their sets of edges are given by rational relations on words, 
i.e. relations recognized by word transducers. 

Definition 2.2 ( |Mor00j ). A rational graph labelled by £ with vertices in T* is given 
by a tuple of transducers (T a ) ae s over T. For all a S S, G has an edge labelled by a 
between vertices u and v 6 T* if and only if (u,v) € T a . For w G S + and a £ S, we write 
T wa = T w o T a , and u — > v if and only if (tt, v) S T w . 

Note that T w oT a stands for the set of all pairs (u, v) such that (u, x) € T w and (x, v) € T a 
for some x. Figure^shows an example of rational graph, the infinite grid, with the rational 
transducers which define its edges. By the properties of rational relations, the support of a 
rational graph is a rational subset of T* . The rational graphs with synchronized transducers 
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were already defined by Blumensath and Grdel in |BG00| under the name automatic graphs 
and by Rispal in |Ris02j under the name synchronized rational graphs. 

It follows from the definitions (see Section |2.2[) that sequential synchronous, synchro- 
nous, synchronized and rational graphs form an increasing hierarchy. This hierarchy is 
strict (up to isomorphism): first, sequential synchronous graphs are deterministic graphs 
whereas synchronous graphs can be non-deterministic. Second, synchronous graphs have a 
finite degree whereas synchronized graphs can have an infinite degree. Finally, to separate 
synchronized graphs from rational graphs, we can use the following properties on the growth 
rate of the out-degree in the case of graphs of finite out-degree. 

Proposition 2.3. |Morfllj For any rational graph G of finite out-degree and any vertex x, 
there exists c£N, such that the out-degree of vertices at distance n of x is at most c c . 

This upper bound can be reached: consider the unlabeled rational graph Go = {T} 
where T is the transducer over V = {A, B} with one state qo which is both initial and final 

and a transition qo qo for all X, Y and Z £ T. It has an out-degree of 2 2 ™ +1 at distance 

n of A. In the case of synchronized graphs of finite out-degree, the bound on the out-degree 
is simply exponential. 

Proposition 2.4. |Ris02j For any synchronized graph G of finite out-degree and vertex x, 
there exists c £ N such that the out-degree of vertices at distance n > of x is at most c n . 

It follows from the above proposition that Go is rational but not synchronized. Hence, 
the synchronized graphs form a strict sub-family of rational graphs. 

2.4. Context sensitive languages 

In this work, we are concerned with the family of context-sensitive languages 1 . Several 
finite formalisms are known to accept this family of languages, the most common being 
linearly bounded machines (LBM), which are Turing machines working in linear space. 
Less well-known acceptors for these languages are bounded tiling systems, which are not 
traditionally studied as language recognizers. However, one can show that these formalisms 
are equivalent, and that syntactical translations exist between them. Since they are at the 
heart of our proof techniques, we now give a detailed definition of tiling systems. For more 
information about linearly bounded machines the reader is referred to |HU79j . 

Tiling systems were originally defined to recognize or specify picture languages, i.e. two- 
dimensional words on finite alphabets |GR96j . They can be seen as a normalized form of 
dominos systems |LS97bj . Such sets of pictures are called local picture languages. However, 
by only looking at the words contained in the first row of each picture of a local picture 
language, one obtains a context-sensitive language |LS97aj . 

A (n, m)-picture p over an alphabet T is a two dimensional array of letters in T with 
n rows and m columns. We denote by p(i,j) the letter occurring in the ith row and jth 
column starting from the top-left corner, by T n ' m the set of (n, m)-pictures and by T** the 
set of all pictures 2 . Given a (n, m)-picture p over T and a letter # T, we denote by p # 
the (n + 2, m + 2)-picture over T U {#} defined by: 
• P#(i,l) =P#{i,m + 2) = # for i G [l,n + 2], 

4n order to simplify our presentation, we only consider context-sensitive languages that do not contain 
the empty word e (this is a standard restriction). 
2 We do not consider the empty picture. 
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Figure 2: A tiling system accepting a n b n (Cf. Example 12. 7(1 . 



• P#(!,i) =P#(n + 2,j) =# for j € [l,m + 2], 

• p#(i + 1, J + 1) = p{i,j) for i € [1, n] and j G [1, m]. 

For any n, m > 2 and any (n, m)-picture p, T(p) is the set of (2, 2)-pictures appearing in 
p. A (2, 2)-picture is also called a tile. A picture language K C T** is /oca? if there exists 
a symbol # ^ T and a finite set of tiles A such that K = {p € T** | T(p#) C A}. To 
any set of pictures over T, we can associate a language of words by looking at the frontiers 
of the pictures. The frontier of a (n, m)-picture p is the word fr(p) = 1) . . . p(l, m) 
corresponding to the first row of the picture. 

Definition 2.5. A tiling system S is a tuple (r, S, #, A) where T is a finite alphabet, S C T 
is the input alphabet, # T is a frame symbol and A is a finite set of tiles over T U {#}. It 
recognizes the local picture language P(S) = {p £ T** | T(p # ) C A} and the word language 
L(5) =fr(P(5))nS*. 

A tiling system S 1 recognizes a language L C S + in height /(n) for some mapping 
/ : N I— ► N if for all w 6 L(S) there exists a (n, m)-picture p in P(S) such that iu = fr (p) 
and n < f(m). We can now precisely state the following well-known equivalence result. 

Theorem 2.6. The following simulations link linearly bounded machines and tiling systems: 

(1) A linearly bounded Turing machine T working in f(n) reversals can be simulated by 
a tiling system of height f(n) + 2. 

(2) A tiling system of height f{n) can be simulated by a linearly bounded Turing machine 
working in f(n) reversals. 

Example 2.7. Figure El shows the set of tiles A of a tiling system S over T = {a,b, _L}, 
X = {a, 6} and the border symbol #. The language L(S) is exactly the set {a n b n \ n > 1}. 

A context-sensitive language is called deterministic if it can be accepted by a determin- 
istic LBM or tiling system, where a tiling system is deterministic if one can infer from each 
row in a picture a single possible next row. 
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Figure 3: Each edge represents an effective transformation preserving languages. 



3. The languages of rational graphs 

In this section, we consider the languages accepted by rational graphs and their sub- 
families from and to a rational set of vertices. We give a simplified presentation of the 
result by Morvan and Stirling [MSOlj stating that the family of rational graphs accepts 
the context-sensitive languages. This is done in several steps. First, Proposition 13.21 states 
that the rational graphs are trace-equivalent to the synchronous rational graphs. Then, 
Proposition 13.31 and Proposition 13 . 71 establish a very tight relationship between synchronous 
graphs and tiling systems. It follows that the languages of synchronous rational graphs are 
also the context-sensitive languages (Theorem 13. 9|) . The original result is given as Corollary 
13.101 Finally, Proposition 13.111 establishes that even the smallest sub-family we consider, 
the family of sequential synchronous rational graphs, accepts all context-sensitive languages. 
The various transformations presented in this section are summarized in Figure El 

3.1. From rational graphs to synchronous graphs 

We present an effective construction that transforms a rational graph G with two ra- 
tional sets / and F of initial and final vertices into a synchronous graph G' trace-equivalent 
between two rational sets I' and F' . The construction is based on replacing the symbol e 
in the transitions of the transducers defining G by a fresh symbol #. 

Let (T a ) ae s be the set of transducers over Y characterizing G and let # be a symbol 
not in T. For all a, we define a to be equal to a if a £ Y, and to e if a = #. We extend 
this to a projection from (r U #)* to Y* in the standard way. We define G' as the rational 
graph defined by the set of transducers (T^) aG s where T' a has the same set of control states 
Q a as T a and a set of transitions given by 

\P — ► q\p — ► Q € T a ) U {p — >p\p £Qa\- 

By definition of each T' a , G' is a synchronous rational graph. Let V and F' be the two 
rational sets such that I' = {u | u G 1} and F' = {v \ v € F} (the automaton accepting /' 
(resp. F') is obtained from the automaton accepting / (resp. F) by adding a loop labeled 
by # on each control state). We claim that G' accepts between I' and F' the same language 
as G between / and F. For example, Figure 0] illustrates the previous construction applied 
to the graph of Figure H Only one connected component of the obtained graph is shown. 

Before we prove the correctness of this construction, we need to establish a couple 
of technical lemmas. Let B be the set of all mappings from N to N. To any mapping 
5 € £>, we associate a mapping from (r U {#})* to (r U {#})* defined as follows: for 
all w = # i(J ai# il . . . a n # i " with a u . . . , a n G T, let 8w = #^+5(0) ai# ii+*(i) _ _ _ an# in+<5(n)_ 
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Before proceeding, we state two properties of these mappings with respect to the sets of 
transducers (T a ) and (T' a ). 

Lemma 3.1. We have the following properties: 

Vu,v£T*, (u,v)eT a 3S U , S v G B, (S u u, 8 v v) G T' a , (3.1) 

V(u, v) G T' a , V8 G B, 35' G B, (6u, S'v) G T' a , (3.2) 

and dually V(u, v) € T' a , G B, 35' G B, (S'u, Sv) G T' a . (3.3) 

We can now prove the correctness of the construction: a word w is accepted by G 
between / and F if and only if it is accepted by G' between /' and F' . 

Proposition 3.2. For every rational graph G and rational sets of vertices I and F, there is 
a synchronous graph G' and two rational sets I' and F' such that L(G, I, F) = L(G', I', F'). 

Proof. We show by induction on n that for all «o> • • • > u n € T*, if there is a path 

w(l) w(n) 
U —> Ui . . . U n _i —> U n , 

then there exist words u' , . . . ,u' n G (r U {#})* such that for all i, u[ = Ui, and 

/ MX) , , w(n) , 
u u l ■ ■ ■ u n-l u n- 

The case where n = is trivial. Suppose the property is true for all paths of length at most 
n, and consider a path 

ui(n) ui(n+l) 
UQ— Un —> U n+1 . 
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Figure 5: Transducers of a synchronous graph accepting {a n b n \ n > 1}. 



By induction hypothesis, one can find mappings So, . . . S n such that 

w(l) w(n) 

Oouo — ► • • • — ► o n u n . 

G> G' 

We now use the properties of mappings stated in Lemma 13,11 By there exist 5' n and 



$'n+i such that <5^u n 



to(n+l) 



^ + i"n+i € G". Let 7„ and 7^ be two elements of B such that 



/ ri w(n+l) , , 
lnK u n —> -f n+1 n+1 U n+1 



$n 7n = ° 7n- By Lemma (|3.2|) and we can find mappings 7^ +1 and 70 to 7„_i 

such that: 

U>(1) «)(n) 

7000^0 — * • • • —7 lnO n U r , 
G G 

which concludes the proof by induction. If we suppose that uq € I and u n € F, then 
necessarily u' € /' and u' n E F'. It follows that for every path in G between I and F, there 
is a path in G" between I' and -F' with the same path label. 

Conversely, by 1)3.1(1 . for any such path in G', erasing the occurrences of # from its 
vertices yields a valid path in G between I and F. Hence L(G,I,F) = L(G' , I' , F'). 



3.2. Equivalence between synchronized graphs and tiling systems 

The following propositions establish the tight relationship between tiling systems and 
synchronous rational graphs. Proposition 13.31 presents an effective transformation of a tiling 
system into a synchronous rational graph. 

Proposition 3.3. Given a tiling system S = (T, S, A), there exists a synchronous ra- 
tional graph G and two rational sets I and F such that L(G,I,F) = L(S). 

Proof. Consider the finite automaton A on T with a set of states Q = T U {#}, initial state 
#, a set of final states F and a set of transitions 8 given by: 
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Figure 6: The synchronous rational graph associated to the tiling system of Figure |2] whose 
transducers are presented in Figure 03 



Call M the language recognized by A, M represents the set of possible last columns of 
pictures of P(S). Note that this does not imply that each word of M actually is the last 
column of a picture in P(S), only that it is compatible with the right border tiles of A. 

Let us build a synchronous rational graph G and two rational sets I and F such that 
L(G,I,F) = L(S). The transitions of the set of transducers (T e ) ee s of G are: 



(#,#)-^(c,d) 

-id 



(a, 6) ^ (c, d) 

1 e 



for all 



for all 



# 


# 


c 


d 


a 


b 


c 


d 



€ A, b,d^#, e € E 



where (#, #) is the unique initial state of each transducer and the set of final states F of 
each transducer is given by: 



F : (a, 6) G (r U {#}) x T such that 



a 


b 


# 


# 



G A. 



A pair of words (s,t) is accepted by the transducer T e if and only if e is the first letter of 
t, and either s and t are two adjacent columns of a picture in -P(S') or s £ #* and i is the 
first column of a picture in P(S). As a consequence, L(5) = L(G, #*,M). □ 

Example 3.4. Figure shows the transducers obtained using the previous construction on 
the tiling system of Figure El They define a rational graph whose path language between 
and 6*_L is {a n b n \ n > 1}. Figure presents the corresponding synchronous graph whose 



vertices are the rational set of words # 



>2 



U a 



~_L + U 6 + _L + , the set of initial vertices is 



and the set of final vertices is 6 + _L. Remark that in this example, the set of vertices 
accessible from the initial vertices is rational: this is not true in the general case. 

Remark 3.5. The correspondence between a tiling system S and the synchronous graph 
G constructed from S in Proposition 13.31 is tight: each picture p with frontier w can be 
mapped to a unique accepting path for w in G (and conversely). 
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Conversely, Proposition 13. 71 states that the languages accepted by synchronous rational 
graphs between rational sets of vertices can be accepted by a tiling system. To make the 
construction simpler, we first prove that the sets of initial and final vertices can be chosen 
over a one-letter alphabet without loss of generality. 

Lemma 3.6. For every synchronous rational graph G with vertices in T* and rational sets 
I and F, one can find a synchronous rational graph H and two symbols i and f £ T such 
that L(G,I,F) = L(H,i*,f*). 

Proof. Let G = (i^ a )aes De a synchronous rational graph with vertices in T* . For i,f two 

new distinct symbols, we define a new synchronous rational graph H characterized by the 

set of transductions (T a = (Tj o K a ) U K a U (K a o 7i?)) agE where Tj = {(i n ,u) \ n > 

0, u £ I, \u\ = n} and Tp = {(v, f n ) | n > 0, v E F, \v\ = n}. For all vertices u £ I, v € F 

we have n ^ v if and only if /H, i.e. L(G, I, F) = L(H, i*, /*). □ 

G H 

We are now able to establish the converse of Proposition 13.31 which states that all the 
languages accepted by synchronous rational graphs between rational sets of vertices can be 
accepted by a tiling system. 

Proposition 3.7. Given a synchronous rational graph G and two rational sets I and F, 
there exists a tiling system S such that L(S) = L(G,I,F). 

Proof. Let G = (T a ) ag £ be a synchronous rational graph with vertices in T* (with £ C T). 
By Lemma l3.fi! we can consider without loss of generality that I = i* and F = f* for some 
distinct letters i and /, and that neither i nor / occurs in any vertex which is not in / or 
F. Furthermore by Remark 12. 11 we can assume that T a is non-ambiguous for all a £ E. 

We write Q a the set of control states of T a . We suppose that all control state sets are 
disjoint, and designate by S Q a the unique initial state of each transducer T a , and by 
Qf the set of final states of all T a . 

Let a, b,c,d 6 S, x, x' , y, y' , z, z' E T, and p,p' , q, q' , r, r', s, s' 6 Uae£ Q a - ^ e define a 
tiling system S = (r,S,#,A), where A is the set of tiles from Figure By construction, 
P(S) is in exact bijection with the set of accepting paths in G with respect to / and F. 
Let (j) be the function associating to a picture p € P(S) with columns a\Wi, . . . ,a n w n , the 
path i\ wi \ — 1 -+ vJ\ . . . w~n where w is obtained by removing the control states from w. 
By construction of S, the function (f> is well defined. It is easy to check that <f> is an onto 
function. As the transducers defining G are non- ambiguous, two distinct pictures have 
distinct images by <j) and therefore <f> is an injection. 

Hence, the tiling system (r,S,#,A) exactly recognizes L(G,I,F). □ 

Remark 3.8. As in Remark 13. 51 the set of paths in G from I to F and the set of pictures 
P(S) accepted by S are in bijection, and the length of the vertices along the path is equal 
to the height of the corresponding picture. 

Putting together Propositions 13.31 and 13.71 and Theorem 12. (i( we obtain the following 
result concerning the path languages of synchronous rational graphs. 

Theorem 3.9 ( Flis02 ). The languages accepted by synchronous rational graphs between 
rational sets of initial and final vertices are the context-sensitive languages. 

Note that this formulation of the theorem could be made a bit more precise by recalling 
that initial and final sets of vertices only of the form x* , where x is a letter, are sufficient 
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Figure 7: Tiling system accepting the language of a synchronous graph. 



to accept all context-sensitive languages, as stated in Lemma 13.61 By Proposition 13.21 this 
implies as a corollary the original result by Morvan and Stirling |MS01j . 

Corollary 3.10. The languages accepted by rational graphs between rational sets of initial 
and final vertices are the context-sensitive languages. 

If we transform a rational graph into a Turing machine by applying successively the 
construction of Proposition 13.21 Proposition 13.71 and Theorem 12.61 we obtain the same 
Turing machine as in |MS01j . 

3.3. Sequential synchronous graphs are enough 

Theorem 13.91 shows that when considering rational sets of initial and final vertices, 
synchronous graphs are enough to accept all context-sensitive languages. Interestingly, 
when considering rational sets of initial and final vertices, the even more restricted class of 
sequential synchronous transducers are sufficient. 

Proposition 3.11. The languages accepted by sequential synchronous rational graphs be- 
tween rational sets of initial and final vertices are the context-sensitive languages. 

Proof. Thanks to Proposition 13.71 it suffices to prove that any context sensitive language 
L C £* is accepted by a synchronous sequential rational graph. By Theorem 12. 6| we know 
that there exists a tiling system S = (T, E, #, A) such that L(S) = L. 

Let A = r U {#} and [ and ] be two symbols that do not belong to A. We associate to 
each picture p G A** with rows l\,...,l n the word [li] . . . [l n ]. We are going to define a set of 
sequential synchronous transducers that, when iterated, recognize the words corresponding 
to pictures in P(S). 

First, for any finite set of tiles A, we construct a transducer Ta which checks that a 
word in ([A- 3 ])- 2 represents a picture with tiles in A. The checking is done column by 
column, and we introduce marked letters to keep track of the column being checked. Let A 
be a finite alphabet in bijection with but disjoint from A. For all x G A we write x G A the 
marked version of x. For every word w = uxv G A*AA*, we write ir(w) the word uxv G A* 
and p(w) = \u\ + 1 designates the position of the marked letter in the word. 
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###### ###### ###### ###### #####£ 

#aoH# #aa66# #aa66# , #aa&6# , #aabb# 

# a 6 # # a ±± 6 # — # a ±1 6 # — # a 1_ ± 6 # — # a ±± 6 # 

#L1_1_1_# #LL±L# #L±±L# #1_1_1_L# #±±±±# 

###### ###### ###### ###### ###### 



Figure 8: Connected component of a sequential synchronous graph accepting {a n 6 n |n > 1}. 



We consider words in [A*AA*]- 2 . Let Shift be the relation that shifts all marks in a 
word one letter to the right. More precisely, Shift satisfies Dom(Shift) = ([A*AA + ])- 2 , and 
Shift([ioi] . . . [w n ]) = [w^] . . . [w' n ] with vr(^) = ir(wi) and p{w' i ) = p(wi) + 1 for all i G [1, n}. 
The rational relation Shift can be realized by a synchronous sequential transducer Tgh- 
Consider the following rational language: 



Ra 



[w n x n y n w. 



' n ]\n>2 and G [2,n], 



Xi-l 


y%-i 


Xi 


Vi 



G A 



The transducer Ta obtained by restricting Tsh to the domain i?A is both synchronous and 
sequential. For all w = [wi] . . . [w n ] G ([AAA*])- 2 , if w' = T£(w) then w' = [w[] . . . [w' n ] 
with ir(wi) = T^{w'j) and p{w' i ) = N + 2 for all i G [1, n]. Let be the word containing the 
+ 1 first letters of w[, a straightforward induction on N proves that the picture p formed 
of the rows T\ , . . . , v n only has tiles in A. In particular, T^{w) belongs to ([A*A])* n i?A if 
and only if ir(w) represent a picture p of width N + 2 such that T(p) C A. 

We now define more precisely the sequential rational graph G = (T a ) aG x; accepting 
L. For all a G S, the transducer T a is obtained by restricting the domain of Ta to 
the set of words representing pictures whose marked symbol on the second row is a, i.e. 
to the set [(A U A)*][(A*aA*][(A U A)*]*. T a can be chosen synchronous and sequential. 
The^set of initial vertices I is [###*]([#fr*#])*[###*] and the set of final vertices F is 

[#*#]([#r*#])*[#*#]. □ 

Example 3.12. Figure |H1 shows a part of the result of the previous construction when 
applied to the language {a n b n \ n > 1} as recognized by the tiling system of Figure EJ Each 
vertex is represented by the corresponding picture, instead of the word coding for it. Also, 
only one connected component of the graph is shown. The other connected components 
all have the same linear structure: the degree of the graph is bounded by 1. The leftmost 
vertex belongs to the set /, and the rightmost to the set F, hence the word a 2 b 2 is accepted. 

Remark 3.13. In the case of synchronized transducers, it has been shown in Lemma 13.61 
that / could be taken over a one letter alphabet without loss of generality. This does not 
seems to hold for sequential transducers as the proof we present relies on the expressiveness 
of the initial set of vertices. In fact, as shown in Proposition 15.31 the languages recognized 
by sequential synchronous graph from i* are deterministic context-sensitive languages. 
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4. Rational graphs seen as automata 



The structure of the graphs obtained in the previous section (propositions 13,31 and 
I3.11|) is very poor. Synchronous graphs are by definition composed of a possibly infinite 
set of finite connected components. In the case of Proposition 13.111 we obtain an even 
more restricted family of graphs since both their in-degree and out-degree is bounded by 
1. However, when considering accepted languages from a possibly infinite rational set of 
vertices, even this extremely restricted family accepts the same languages as the most 
general rational graphs, namely all context-sensitive languages. This is why, in order to 
compare the expressiveness of the different sub-families of rational graphs and to obtain 
graphs with richer structures, we need to impose structural restrictions. 

We first consider graphs with a single initial vertex, but this restriction alone is not 
enough. In fact, both synchronized and rational graphs with a rational set of initial vertices 
accept the same languages as their counterparts with a single initial vertex. 

Lemma 4.1. For every rational graph (resp. synchronized graph) G and for every pair of 
rational sets I and F, there exists a rational graph (resp. a synchronized graph) G' , a vertex 
i and a rational set F' such that L(G, I, F) = L(G', {i}, F'). 

Proof. Let G = (T a ) aG x; be a rational graph with vertices in V* and let i be a symbol which 
does not belong to T and r" = T U {i}. For all a € £, let T' a be a transducer recognizing 
the rational relation T a U {(i,w) \ w G T a (I)}. Remark that if T a is synchronized then T' a 
can also be chosen synchronized. If e L(G,I,F) then F' = F else F' = F U {i}. It is 



It follows from Proposition 13.31 and Lemma 14.11 that the synchronized rational graphs 
with one initial vertex accept the context-sensitive languages |Risf)2| . 

Remark 4.2. It is fairly obvious that this result does not hold for synchronous graphs: 
indeed, the restriction of a synchronous rational graph to the vertices reachable from a 
single vertex is finite. Hence, the languages of synchronous graphs from a single vertex are 
rational. Similarly, as any rational language is accepted by a deterministic finite graph, it 
can also be accepted by a sequential synchronous graph with a single initial vertex. 

Note that the construction of Lemma 14.11 relies on infinite out-degree to transform a 
synchronous graph with a rational set of initial vertices into a rational one with a single 
initial vertex. In order to obtain more satisfactory notions of infinite automata, we now 
restrict our attention to graphs of finite out-degree with a single initial vertex. 

4.1. Rational graphs of finite out-degree with one initial vertex. 

We present a syntactical transformation of a synchronous rational graph with a rational 
set of initial vertices into a rational graph of finite out-degree with a unique initial vertex 
accepting the same language. 

The construction relies on the fact that for a synchronous graphs to recognize a word 
of length n > 0, it is only necessary to consider vertices whose length is smaller than c n 
(where c is a constant depending only on the graph). We first establish a similar result 
for tiling systems and conclude using the close correspondence between synchronous graphs 
and tiling systems established in Proposition 13.71 




□ 
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Lemma 4.3. For any tiling system S = (T, E, Jf,A), if p G P(S) then there exists a 
(n, m)-picture p' such that fr (p) = fr {p') and n < \T\ m . 

Proof. Let p' be a (n, m)-picture with n > \T\ m , and suppose that p' is the smallest picture 
in P(S) with frontier fr (p). Let h,...,l n be the rows of p' . As n > \T\ m then there exists 
j > i > 1 such that Zj = Zj. Let p" be the picture with rows l\, . . . , Zj, lj+i, . . . ,l n . It is easy 
to check that T(p" # ) C T(p' # ), we have that p" G P (S) and as p" has a smaller height than 
p' but the same frontier, we obtain a contradiction. □ 

We know from Remark 13.81 that for every synchronous rational graph G = (T a ) aG s and 
two rational sets I and F, there exists a tiling system 5 such that i f with i G I and 

/ G F if and only if there exists p £ K such that fr (p) = w and p has height |i| = |/|. 
Hence, as a direct consequence of Lemma H31 one gets: 

Lemma 4.4. For every synchronous rational graph G and rational sets I and F, there 
exists k > 1 such that: 

\fw G L(G,I,F),3i £/,/ EF such that i f and \i\ = \ f\ < £>l 

G 

We can now construct of a rational graph of finite out-degree accepting from a single 
vertex the same language as a synchronous graph with a rational set of initial vertices. 

Proposition 4.5. For every synchronous rational graph G and rational sets I and F such 
that I n F = 0, there is a rational graph H of finite out-degree and a vertex i such that 
L(G,I,F)=L(H, {i},F). 

Proof. According to Lemma 13.61 there exists a synchronous rational graph R described by 

a set of transducers (T a ) aeS over T* such that L(G,I,F) = L(R,#*,F). Note that for all 

w G #* and w' G T*, if w — ► w' then w' does not contain #. We define a graph H such 

R 

that L(G,I,F) = L(H, {i},F) for some vertex i of i7. Let k be the constant involved in 
Lemma 14.41 T and T" two transducers realizing the rational relations {(# n ,# fcn ) | G N} 
and {#", # m | m G [1, n]} respectively. For all a, 6, c G £ and u G £*, -ff has edges: 



Vn G N, 




a 


ua\ToT(if n ) 


(Type 1) 


Vn G N, 


H# n 


a 


ua\ ToT'o T b (# n ) 


(Type 2) 


Vn G N, 


6cuj# n 


a 


ua\T' oT 6 oT c (# n ) 


(Type 3) 


Vu,G(r\ {#})% 


bcu\w 


a 


ua\ T b o T c (w) 


(Type 4) 


V«;G(r\ {#})*, 


b\w 


a 


T b o T a (w) 


(Type 5) 




\* 


a 


ToT'oT a (#) 


(Type 6) 



The graph H is clearly rational and of finite out-degree. We take i = |# as initial vertex. 
Remark that in H an edge of type 2 or 3 cannot be followed by edges of type 1, 2 or 3, and 
at most one edge of type 2 or 3 and of type 5 or 6 can be applied. Moreover, an edge of type 
1 increases the length of the left part of the word by one, and an edge of type 4 decreases it 
by one. Also, in any accepting path, the last edge is of type 5 or 6. Figure |2l illustrates the 
structure of the obtained graph. It is technical but straightforward to show a correspondence 
between accepting paths in H and R, and to conclude that L(R, #*, F) = L(H, {i}, F). □ 



CONTEXT-SENSITIVE LANGUAGES, RATIONAL GRAPHS AND DETERMINISM 



17 



(1)* 



(i)* 

^^^a|T 2 (#) ■■■■ 
••. (1)* 



cua\T b (# m - 4 ") 

a(2) I 
I 

„ci) ; 

ua\T b oT c (# m - n ) 
•(4)* 



(4)* 

(4)* 

(4)* ... 



(4) 



t a(4) I 

fecw'|«; ► ii'a|T(, o T c (u>) | 

I 

I I L 



o(5) 



Wf 

I 
I 
I 
I 
I 
I 
I 
I 
I 



(Type 1)* Type 2 or 3 (Type 4)* 

(guessing the size) (starting computation) (catching up) 



Type 5 
(accepting) 



Figure 9: Schema of the construction in Proposition 14.51 



From Proposition 13,21 and Proposition 14. 5( we deduce that the rational graphs of finite 
out-degree with one initial vertex accept all context-sensitive languages. This result was 
proved in |MS01| using the Penttonen normal form of context-sensitive grammars |Pen74j . 

Theorem 4.6. The path languages of rational graphs of finite out-degree from a unique 
initial vertex to a rational set of final vertices are the context-sensitive languages. 



4.2. Synchronized graphs of finite out-degree with one initial vertex 

We now consider the languages of synchronized graphs of finite out-degree with one 
initial vertex. First, we characterize them as the languages recognized by tiling systems 
with square pictures (i.e. for which there exists c € N such that for every word w £ L(S), 
there exists a (n, m)-picture in P(S) with n < cm and with frontier w). A slight adaptation 
of the construction of Proposition 14.51 gives the first inclusion. 

Proposition 4.7. Let S = (T, S, ff, A) be a tiling system with square pictures. There exists 
a synchronized rational graph of finite degree accepting L(S) from one initial vertex. 

Proof. Let G = (T a ) ag £ be the synchronized graph obtained from S in Proposition VA.'Al 
In the construction from the proof of Proposition 14.51 if we replace the transducer T by 
a transducer S realizing the synchronized relation {(# n , # n+c ) | n € N}, we obtain a 
synchronized graph H, a vertex i and a set F such that L(H, i, F) = L(S). □ 

Before proceeding with the converse, we state a result similar to Lemma 14.41 for syn- 
chronized graphs of finite out- degree that states that when recognizing a word w from a 
unique initial vertex i, the vertices involved have a length at most linear in the size of w. 

Lemma 4.8. For any synchronized rational graph G of finite out-degree with vertices in T* 
and for every vertex i, there exists a constant k such that for all w in L(G,{i}, F), there 
exists a path from i to some f G F, labeled by w, and with vertices of size at most k ■ \w\. 



18 



A. CARAYOL AND A. MEYER 



Proof. It follows from the definition of synchronized transducers that for every synchronized 
transducer of finite out-degree there exists c € N such that (x,y) € T implies that \x\ < \y\+c 
(sec Sak03 for a proof of this result). We take k to be the maximum over the set of 
transducers defining G of these constants. The result follows by a straightforward induction 
on the size of w. □ 

The converse inclusion is obtained by remarking that the composition of the construc- 
tion of Proposition 13 . 21 and Proposition 13.71 gives a tiling system with square pictures when 
applied to a synchronized graph of finite out-degree. 

Proposition 4.9. Let G = (T a ) a6 s be a synchronized graph of finite out-degree. For every 
initial vertex i and set of final vertices F, there exists a tiling system S with square pictures 
such that L(S) = L{G, {i},F). 

Proof. Let G' , V and F' be the synchronous graph and the rational set of initial and final 
vertices obtained by applying the constructions of Proposition 13.21 to G, {i} and F. It is 
easy to show that for every word w £ L(G' , F 1 ), there exists i' S I' and /' E F' such that 
*' ='* /' with \i'\ = \ f'\ < k\w\ where k is the constant of Lemma [4.81 for G. We conclude by 
Proposition l3.71 that states the existence of a tiling system S such that L(S) = L(G' , V , F'). 
By Remark 13, 5| S is a tiling system with square pictures. □ 

Putting together Proposition l4.9l and Proposition ^. 71 and with the use of the simulation 
result from Theorem 12. fi( we obtain the following theorem. 

Theorem 4.10. The languages accepted by synchronized graphs of finite out-degree from a 
unique vertex to a rational set of vertices are the context-sensitive languages recognized by 
non- deterministic linearly bounded machines with a linear number of head reversals. 

We conjecture that this class is strictly contained in the context-sensitive languages. 
However, few separation results exist for complexity classes defined by time and space 
restrictions (see for example |vM04| ) . In particular, the diagonalization techniques (see 
|For00j ) used to prove that the polynomial time hierarchy (with no space restriction) is 
strict do not apply for lack of a suitable notion of universal LBM. 

4.3. Bounding the out-degree 

It is natural to wonder if the rational graphs still accept the context-sensitive languages 
when considering bounded out-degree. This is a difficult question, to which we only provide 
here a partial answer concerning the synchronized graphs of bounded out-degree. 

It follows from Lemma 14.81 that the vertices used to accept a word wina synchronized 
rational graph have a length at most linear in the length of w and therefore, can be stored on 
the tape of a LBM. Moreover if the graph is deterministic, we can construct a deterministic 
LBM accepting its language. 

Proposition 4.11. The language accepted by a deterministic synchronized graph from a 
unique initial vertex is deterministic context-sensitive. 

Proof. Let G = (T a ) ag £ be a deterministic synchronized graph over V, i a vertex and F a 
rational set of vertices. We define a deterministic LBM M accepting L(G,{i}, F). When 
accepting w = a\ . . . a\ w \i M starts by writing i on its tape. It successively applies T ai , . . . , 
Ta n _ 1 and T an to i. If the image of the current tape content by one of these transducers 
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is not denned, the machine rejects. Otherwise, it checks whether the last tape content 
represents a vertex which belongs to F. 

We now detail how the machine M can apply one of the transducers T of G to a word x 
in a deterministic manner. As T has a finite image, we can assume without loss of generality 
that T = (r, Q, i, F, 5) is in real-time normal form: 5cQxTxT*xQ (see for instance 
|Ber79j for a presentation of this result). The machine enumerates all paths in T of length 
less than c\x\ in the lexicographic order where c is the constant associated to G in Lemma 
14.81 For each such path p, it checks if it is an accepting path for input x, and in that case 
replaces x by the output of p. 

The space used by M when starting with a word w is bounded by (2c+ l)|iu|. Moreover 
if M accepts w, then there exists a path from i to a vertex F in G labeled by w. Conversely, 
if w belongs to L(G, {%}, F) then by Lemma 14.81 there exists a path in G from i to F with 
vertices of length at most c\w\ and by construction M accepts w. Hence, M is a deterministic 
linearly bounded Turing machine accepting L(G, {i}, F). □ 

Remark 4.12. The result of Proposition 14. 1 ll extends to any deterministic rational graph 
satisfying the property expressed by Lemma 14.81 

The previous result can be extended to synchronized graphs of bounded out-degree 
thanks to a uniformization result by Weber. First observe that a rational graph is of out- 
degree bounded by some constant k if and only if it is defined by transducers which associate 
at most k distinct images to any input word. The relations realized by these transducers 
are called /c-valued rational relations. 

Proposition 4.13 ( [Web96| ). For any k-valued rational relation R, there exist k functional 
rational relations F\, . . . , F^ such that R = Ui<=[i fci Fi- 

Note that even if R is a synchronized relation, the Fi's are not necessarily synchronized. 
However, they still satisfy the inequality |y| < \x\ + c for all (x,y) £ Fj. 

To any synchronized graph G with an out-degree bounded by k defined by a set of trans- 
ducers (T a ) ae x;, we associate the deterministic rational graph H defined by (-Poi)oeS,ie[i,Jfc] 
where for all a € S, {F ai ) ie nu is the set of rational functions associated to T a by Proposi- 
tion According to Proposition 14. Ill and to Remark 14.121 L(H,{i},F) is a determin- 
istic context-sensitive language. Let ir be the alphabetical projection defined by 7r(aj) = a 
for all a € S and i € [1, k], it is straightforward to establish that ir (L(H, {i}, F)) = 
L(G,{i},F). As deterministic context-sensitive languages are closed under alphabetical 
projections, L(G, {i},F) is a deterministic context-sensitive language. 

Theorem 4.14. The language accepted by a synchronized graph of bounded out-degree from 
a unique initial vertex is deterministic context-sensitive. 

The converse result is not clear, for reasons similar to those presented in the previous 
section for synchronized graphs of finite degree. A precise characterization of the family of 
languages accepted by synchronized rational graphs of bounded degree would be interesting. 

5. Notions of determinism 

In this last part of the section on rational graphs, we investigate families of graphs which 
accept the deterministic context-sensitive languages. First of all, we examine the family 
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yielded by the previous constructions when applied to deterministic languages. Then, we 
propose a global property over sets of transducers characterizing a sub-family of rational 
graphs whose languages are precisely the deterministic context-sensitive languages. 

5.1. Unambiguous context-sensitive languages 

When applying the construction of Proposition 13.31 to a deterministic tiling system S, 
one obtains a synchronous rational graph G (which is non-deterministic in general) and two 
rational sets of vertices / and F such that L(G,I,F) = L(S), with the particularity that 
for every word w in L, there is exactly one path labeled by w leading from some vertex in I 
to a vertex in F: G is unambiguous with respect to I and F. However, the converse is not 
granted: given a graph G and two rational sets / and F such that G is unambiguous with 
respect to / and F, we cannot ensure that L(G, I, F) is a deterministic context-sensitive 
language. Rather, the obtained languages can be accepted by unambiguous linearly bounded 
machines. This class of languages is called USPACE(n), and it is not known whether it 
coincides with either the context-sensitive or deterministic context-sensitive languages. 

Theorem 5.1. Let L be a language, the following properties are equivalent: 

(1) L is an unambiguous context-sensitive language. 

(2) There exist a rational graph G with unambiguous transducers and two rational sets 
I and F with respect to which G is unambiguous, such that L = L(G,I,F). 

This result only holds if one considers unambiguous transducers, i.e. transducers in 
which there is at most one accepting path per pair of words. The reason is that ambiguity 
in the transducers would induce ambiguity in the machine. However, since synchronized 
transducers can be made unambiguous (Cf. B,emark l2.1j) . we can drop this requirement in 
the case of synchronized graphs. Note that the unambiguity of a rational or synchronized 
graphs with respect to rational sets of vertices is undecidable. However, since any rational 
function can be realized by an unambiguous transducer |Kob69| ISak03| . the language of 
any deterministic rational graph is, by to Theorem 15. 11 unambiguous. 

Corollary 5.2. The languages of deterministic rational graphs from an initial vertex i to 
a rational set F of vertices are unambiguous context-sensitive languages. 

5.2. Globally deterministic sets of transducers 

We just saw an attempt at characterizing natural families of graphs whose languages are 
the deterministic context-sensitive languages, which was based on a restriction of previous 
constructions to the deterministic case, but failed to meet its objective because of a slight 
nuance between the notions of determinism and unambiguity for tiling systems. 

First, we naturally consider the class of sequential synchronous automata with an initial 
set of the form {a}*, where a is a letter of the vertex alphabet (in other words, a given 
initial vertex does not code for any information besides its length). It is easy to check that 
when applying the construction of Proposition 13.71 to one of these automata, we obtain a 
deterministic tiling system. 

Proposition 5.3. The languages of sequential synchronous graphs from {a}* are determin- 
istic context-sensitive languages. 
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The converse result seems difficult to prove due to the local nature of the determinism 
involved in this class. Hence, we consider a global property of the set of transducers charac- 
terizing a rational graph, so as to ensure that each accepting path corresponds to the run of 
a deterministic linearly bounded machine on the corresponding input, or equivalently that 
each accepting path corresponds to a picture recognized by a deterministic tiling system 
and whose upper frontier is the path label under consideration. 

For any rational language L, we write Tp the minimal synchronous transducer recog- 
nizing the identity relation over L. 

Definition 5.4. Let T be a set of synchronous transducers over T. We say T is globally 
deterministic with respect to two rational languages / and F C V* if all transducers in T 
are deterministic 3 and for every pair of transducers T\ G T U {Tj} and T2 G T U {Tp}, and 
every pair of control states q± G Qt 1 and q 2 G Qt 2 j there is at most one b such that 

a/b . b/c . I . 

qi — ► q 1 A q 2 — ► q 2 for some a, c G T, q 1 G <y Tl , g 2 <G C^- 
Ti T2 

Intuitively, this condition states that, whenever a part of the output of one transducer 
can be read as input by a second transducer, there is only one way to add a letter to this word 
such that it is still compatible with both transducers. This property of sets of transducers is 
trivially decidable, since it is sufficient to check the above condition for every pair of control 
states of transducers in (T U {Tj}) x (T U {Tp}). This allows us to capture a sub-family of 
rational graphs whose languages are the deterministic context-sensitive languages. 

Theorem 5.5. Let L be a language, the following two properties are equivalent: 

(1) L is a deterministic context-sensitive language. 

(2) There is a synchronous rational graph G and two rational sets I and F such that 
L = L(G, I, F) and G is globally deterministic between I and F. 

Proof. Let G = (T a ) ag £ be a synchronous rational graph which is globally deterministic 
between / and F. The graph H = (T^) ae s obtained by applying Lemma I3~o1 to G is such 
that L(H,i*,f*) = L(G,I,F). Moreover, H is globally deterministic between i* and /*. 

We will show that the construction of Proposition 13. 71 when applied to a rational graph 
H between i* and /* yields a deterministic tiling system. Suppose that this is not the case. 
Then, by definition of a non-deterministic tiling system, there must be words u, v\ and v 2 
with vi ^ v 2 such that the two-rows pictures p± and p 2 with first row and second row 
and #^2# respectively only have tiles in A. Since v± ^ v 2 , let i be the smallest index 
such that v\{i) 7^ v 2 {i). Let v\{%) = xp, v 2 {i) = x'p' . 

By the construction of Prop. 13771 there are two transducers T a and Tf, such that 

y/x x/z 1 y/x' , x'/z' ., 

qa—^p/\qb—*q b ^q a —*P ^qb—^q b 

±a lb lb 

for some symbols y,y',z,z' G T and control states q a ,qb,q'b an d q'l- As T a is deterministic, 
if x is equal to x' , then p = p' and v\(i) = v 2 (i). Hence x 7^ x', and the above relations 
contradicts the global determinacy of H. 

To prove the converse, we introduce yet another family of acceptors for context-sensitive 
languages, namely cellular automata. A cellular automaton is a tuple (r, S, F, [,], 5) where 
r and S C r are the work and input alphabets, FCT, [, ] T and 5 is a set of 4-tuples 

3 i.e. whenever q —> q and q —* q" with q 7^ q" , it implies (a, b) 7^ (c, d). 
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over r called transition rules. These rules induce a transition relation over words of the 
form [u] G [r*]: d = [v] is a successor of c = [u] if |c| = |c'| = n and for all i E [2,n— 1], 
(c(i — l),c(i),c(i + l),c'(i)) G S. A word w is accepted if, starting from [w] one can derive 
a word [u] with -u € F*. An cellular automaton is deterministic if for all A,B,C there is 
at most one D such that (A, B, C, D) G 5. Equivalence of (deterministic) cellular automata 
with (deterministic) LBMs or tiling systems is common knowledge. 

Let L be any deterministic context-sensitive language, there exists a deterministic cel- 
lular automaton C = (T, E, _L, S, [, ]) recognizing L. One can easily build two rational 
languages / and F and a set of transducers T globally deterministic with respect to / 
and F such that L(G,I,F) = L where G is the rational graph defined by T. The work 
alphabet of T is T' = EU {[, ]} U S. The set of control states of transducer T a G T is 
{lo} U {Qab \ A, B £ T U {[}}, where q$ is the unique initial state. Its transitions are: 

Va,6GE, q a Q ^q a [a and q a ^ q a ba 

Vdi = (A, B, C, BO, da = (5, C, B, C) G <5, g£ c ^ q a B , c , 

The terminal states of T a are and Now let / = ([)* and F = SB* where R = 

{(a, b,], b') G 5 | a,b,b' G T}. By construction and since C is deterministic, T is globally 
deterministic with respect to / and F. One can easily verify that L(G, I, F) = L. □ 



6. Conclusion 

This work is a summary of new and existing results concerning rational graphs and 
their relation to context-sensitive languages. Its main contributions are, first, to show the 
language equivalence between rational graphs and synchronous rational graphs, and second 
to establish a tight connection between synchronous rational graphs and finite tiling systems. 
Since tiling systems accept precisely the context-sensitive languages, this yields a new and 
simpler proof that the languages of rational graphs coincide with this family. Thanks to 
this, we studied the impact of structural restrictions on the obtained family of languages, 
in particular when considering finite or bounded degree and a single initial vertex. 

This approach also enables us to consider the case of deterministic languages. We 
show how one can define sub-families of rational graphs whose languages are precisely the 
unambiguous or deterministic context-sensitive languages. However, due to their syntactical 
nature, these results brings little new insight as to the difficult question of the strictness of 
inclusions between deterministic, unambiguous and general context-sensitive languages. 

This presentation gives rise to a few interesting open questions. A thorough study of 
graphs of bounded degree seems necessary, albeit difficult. More generally, the question 
of knowing whether any "tractable" family of graphs accepting the context-sensitive lan- 
guages exists remains. We saw that synchronous graphs are not a good option since they 
lose all their expressive power when only a finite number of initial vertices are considered. 
Synchronized graphs form an interesting class, especially since their first order theory is 
decidable, but it seems reasonable to believe that they require infinite out-degree to accept 
all context-sensitive languages. 
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Another question is to compare the rational graphs with the transition graphs of linearly 
bounded machines [KP991 PayOO . This last point is addressed to some extent in jCM05j . 



where it is shown that all bounded degree rational graphs are isomorphic to transition 
graphs of linearly bounded machines. 
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